Design Framework of a Database for Structured Documents with Object Links
نویسندگان
چکیده
Structured documents often contain character strings of which semantics can be naturally stored as database values or has direct correspondence with database values. By building bilateral logical links between character strings in documents and corresponding database values, semantically rich queries are made expressible. We have introduced a new ADT, named “paratext,” to model text which has links with database values. Paratexts are logically viewed as consisting of two parallel layers; on the “appearance” layer, ordinary text (i.e. a linear sequence of character strings) is placed, while the “reference” layer holds an array of OIDs and literals. Each OID or literal on the reference layer is associated with a contiguous substring of the appearance layer text, and represents the semantics of the associated substring. We have also designed domain-specific functions for this document model. Using the functions, we can express queries which go back and forth between the two layers. In structured documents, such character strings can appear in the whole content of logical elements, or as phrases inside logical elements. We also present frameworks for the implementation of the paratext ADT, and discuss how traditional full-text indexing techniques can be extended to support paratext. key words: structured document, database, hypertext, abstract data type
منابع مشابه
An Extensible Schema -less Database Framework for Managing High-throughput Semi-Structured Documents
Object-Relational database management system is an integrated hybrid cooperative approach to combine the best practices of both the relational model utilizing SQL queries and the object oriented, semantic paradigm for supporting complex data creation. In this paper, a highly scalable, information on demand database framework, called NETMARK, is introduced. NETMARK takes advantages of the Oracle...
متن کاملطراحی و ساخت پایگاه وب منابع اطلاعات شاخص های پایش و ارزیابی علم، فناوری و نوآوری
So far, many indicators for evaluation of science, technology and innovation have been presented in various documents in Iran. Also, many indicators have been mentioned in the reports of international organizations. Selection and use of the indicators is difficult for policy makers and researchers because of the abundance and distribution of them in various domestic and international documents ...
متن کاملKlemens Böhm Building a Configurable Database Application for Structured Documents
Storing structured documents in object-oriented databases and fragmenting them according to their logical structure gives way to more expressive querying mechanisms, as compared to conventional document-management systems. At a rst stage, however, such an approach does not come out too good with regard to the performance of certain other basic operations. Thus, our database-application framewor...
متن کاملBuilding a Configurable Database Application for Structured Documents
Storing structured documents in object-oriented databases and fragmenting them according to their logical structure gives way to more expressive querying mechanisms, as compared to conventional document-management systems. At a rst stage, however, such an approach does not come out too good with regard to the performance of certain other basic operations. Thus, our database-application framewor...
متن کاملPublishing RDF from Relational Database Based on D2R Improvement
As a key technology to implement Semantic Web, linked data have gradually been an academic and industrial concern. Linked data represents a practice of technologies on the web and linked structure data. The goal of linked data is to enable people to share structured data on the web as easily as they can share documents today. On the Web of data structured with linked data, users can jump from o...
متن کامل